Scalable proxy clusters

ABSTRACT

The invention enables high-availability, high-scale, high security and disaster recovery for API computing, including in terms of capture of data traffic passing through proxies, routing communications between clients and servers, and load balancing and/or forwarding functions. The invention inter alia provides (i) a scalable cluster of proxies configured to route communications between clients and servers, without any single point of failure, (ii) proxy nodes configured for implementing the scalable cluster (iii) efficient methods of configuring the proxy cluster, (iv) natural resiliency of clusters and/or proxy nodes within a cluster, (v) methods for scaling of clusters, (vi) configurability of clusters to span multiple servers, multiple racks and multiple datacenters, thereby ensuring high availability and disaster recovery (vii) switching between proxies or between servers without loss of session.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 16/050,958, filed Jul. 31, 2018, now U.S. Pat. No. 10,484,337, which is a divisional of U.S. patent application Ser. No. 15/164,512, filed May 25, 2016, now U.S. Pat. No. 10,701,037, which claims the benefit of U.S. Provisional Application Ser. No. 62/167,165, filed May 27, 2015, the disclosures of each of which are hereby incorporated by reference in their entirety.

FIELD OF THE INVENTION

The present invention relates to the field of high-availability, high-scale and high security computing for API computing and API ecosystems. In particular, the invention provides scalable proxy clusters, and methods for configuring proxy clusters and/or proxy nodes within the proxy clusters.

BACKGROUND

The use of a proxy as an intermediary between a client (i.e. a device requesting a service) and a server (i.e. a device providing the service) is known. Proxies can typically be used to implement several different networking functions, including any one or more of securing or capturing data samples of data traffic passing through such proxies, routing, load balancing and forwarding functions.

FIG. 1 illustrates a networking architecture comprising client 102, server backend 106 comprising servers 106 a to 106 c, proxy 104 and DNS server 108. Based on information retrieved from DNS server 108, requests or messages from client 102 for services from server backend 106 are directed to proxy 102. Proxy 102 thereafter transmits the received requests or messages to an appropriate server (106 a to 106 c) within server backend 106. Depending on the configuration of proxy 104, responses from servers 106 a to 106 c may first be received at proxy 102 and thereafter redirected to requesting client 102.

Proxy based configurations of the type illustrated in FIG. 1 have a finite processing capacity—which limits the number of clients and servers a proxy can simultaneously service. Additionally prior art configurations present limitations in terms of high availability—where “high availability” refers to the characteristic of a system to continue running and handling failures with minimum planned or unplanned down time.

There is accordingly a need for (i) a scalable cluster of proxies configured to route communications between clients and servers, without any single point of failure, (ii) efficient methods of configuring and scaling the cluster, (iii) natural resiliency of clusters, (iv) efficient scaling of such clusters, (v) configurability of such clusters to span multiple servers, multiple data racks and multiple data centers and (vi) to provide for switching between proxies in case of a proxy failure or between servers in case of failure of a server, rack or data center without loss of session information—thereby ensuring high availability and disaster recovery.

SUMMARY

The invention provides scalable proxy clusters, and methods for configuring proxy clusters and/or proxy nodes within the proxy clusters.

The invention provides a proxy node configured for implementation within a proxy cluster comprising a plurality of networked proxy nodes. The proxy node comprises (i) a processor, (ii) a proxy router configured to transmit received client message to one or more servers identified based on a specified routing policy, and (iii) a synchronization controller configured to respond to a defined synchronization event, by synchronizing one or more data states of the proxy node with corresponding one or more data states of at least one other proxy node within the plurality of proxy nodes.

The synchronization controller may be configured to respond to a defined synchronization event by synchronizing the one or more data states of the proxy node with corresponding one or more data states of every other proxy node within the plurality of proxy nodes.

The one or more data states of the proxy node or the corresponding one or more data states of the at least one other proxy node may comprise data states corresponding to any one or more of server characteristic data, session data, security data, configuration data and proxy node data associated with the respective proxy node.

The one the one or more data states of the proxy node or the corresponding one or more data states of the at least one other proxy node may comprise data states corresponding to server characteristic data, session data, security data, and configuration data associated with the respective proxy node.

In an embodiment of the invention, the proxy router may be configured such that routing functionality of the proxy node is identical to routing functionality of at least one other proxy node within the plurality of proxy nodes.

In an embodiment of the invention, the proxy node may be configured for self-learning one or more functional capabilities of one or more other proxy nodes within the plurality of proxy nodes—wherein said self-learning is based on the synchronizing one or more data states of the proxy node with corresponding one or more data states of at the one or more other proxy nodes within the plurality of proxy nodes.

The invention additionally provides a proxy cluster comprising a plurality of networked proxy nodes. At least one of the plurality of proxy nodes respectively comprises (i) a processor, (ii) a proxy router configured to transmit received client message to one or more servers identified based on a specified routing policy, and (iii) a synchronization controller configured to respond to a defined synchronization event, by synchronizing one or more data states of the proxy node with corresponding one or more data states of at least one other proxy node within the plurality of proxy nodes. The synchronization controller may be configured to respond to a defined synchronization event by synchronizing the one or more data states of the proxy node with corresponding one or more data states of every other proxy node within the plurality of proxy nodes.

One or more data states of the proxy node within the proxy cluster, or the corresponding one or more data states of the at least one other proxy node within the proxy cluster may comprise data states corresponding to any one or more of server characteristic data, session data, security data, configuration data and proxy node data associated with the respective proxy node.

The invention additionally provides a method of synchronizing data states between proxy nodes within a networked cluster of proxy nodes. The method comprises (i) detecting a synchronization event at a first proxy node within the cluster of proxy nodes, (ii) selecting a second proxy node from among the cluster of proxy nodes, and (iii) synchronizing one or more data states of the first proxy node with corresponding one or more data states of the second proxy node within the cluster of proxy nodes. Each proxy node within the cluster of proxy nodes may be configured to transmit received client message to one or more servers identified based on a specified routing policy.

In an embodiment of the method, the one or more data states of the first proxy node may be synchronized with one or more data states of every other proxy node within the cluster of proxy nodes. In a method embodiment, the one or more data states of the first proxy node or the corresponding one or more data states of the second proxy node may comprise data states corresponding to any one or more of server characteristic data, session data, security data, configuration data and proxy node data associated with the respective proxy node.

The invention additionally provides a method of adding a proxy node to a networked cluster of proxy nodes. The method comprises configuring a processor implemented first proxy node to (i) transmit received client message to one or more servers identified based on one or more routing policies, and (ii) respond to a defined synchronization event, by synchronizing one or more data states of the first proxy node with corresponding one or more data states of one or more proxy nodes within the cluster of proxy nodes. In an embodiment, the one or more data states of the first proxy node are synchronized with one or more data states of every proxy node within the cluster of proxy nodes. The one or more data states of the first proxy node or the corresponding one or more data states of the second proxy node may comprise data states corresponding to any one or more of server characteristic data, session data, security data, configuration data and proxy node data associated with the respective proxy node.

The invention additionally provides a method of modifying configuration of a proxy cluster comprising a plurality of networked proxy nodes, wherein each of the plurality of proxy nodes is configured to (i) transmit received client message to one or more servers identified based on a specified routing policy, and (ii) responsive to detection of a synchronization event, synchronize one or more data states of said proxy node with corresponding one or more data states of at least one other proxy node within the plurality of proxy nodes. The method comprises (i) receiving operator input identifying a modification to configuration of a first proxy node within the plurality of proxy nodes, (ii) responsive to the receive operator input, modifying the configuration of the first proxy node, and (iii) implementing a modification to configuration of a second proxy node within the plurality of proxy nodes, wherein said modification is effected by synchronization of one or more data states of said second proxy node with corresponding one or more data states of said first proxy, in response to detection of a synchronization event by the second proxy node.

The invention additionally provides a proxy cluster comprising a plurality of networked proxy nodes, wherein at least one of the plurality of proxy nodes respectively comprises (i) a processor, (ii) a proxy router configured to transmit received client message to one or more servers identified based on a specified routing policy, and (iii) a synchronization controller configured to respond to a defined synchronization event, by synchronizing one or more data states of the proxy node with corresponding one or more data states of at least one other proxy node within the plurality of proxy nodes. Additionally, the proxy cluster may be configured for one or more of high availability, disaster recovery, scalability and security of API computing use (i) within data centers, (ii) within private, public or hybrid clouds, (iii) across multiple datacenters, (iv) across private, public or hybrid clouds, and (v) across a combination of one or more datacenters and one or more clouds.

The invention additionally provides computer program products for implementing one or more of the above methods, the computer program product comprising a non-transitory computer usable medium having a computer readable program code embodied therein, said computer readable program code comprising instructions for implementing said one or more methods.

BRIEF DESCRIPTION OF THE ACCOMPANYING DRAWINGS

FIG. 1 illustrates a network configuration involving a prior art proxy interposed between a client device and a server backend.

FIGS. 2 and 3 illustrate exemplary scalable proxy clusters.

FIG. 4 illustrates a proxy node within a proxy cluster.

FIG. 5 illustrates a peer-to-peer network configuration of a proxy cluster.

FIG. 6 is a flowchart illustrating a method of synchronizing proxy nodes within a proxy cluster.

FIG. 7 illustrates a method of adding a new proxy node to a proxy cluster.

FIG. 8 illustrates an exemplary system in accordance with the present invention.

FIG. 9 illustrates an embodiment of a software architecture for an API Load Balancer, according to embodiments described herein.

DETAILED DESCRIPTION

The present invention provides a scalable cluster of proxies configured, which proxies may in various non limiting examples be configured for one or more of securing or capturing data samples of data traffic passing through such proxies, routing communications between one or more clients and one or more servers, load balancing and forwarding functions. The invention additionally provides for (i) a scalable cluster of proxies configured to route communications between clients and servers, without any single point of failure, (ii) efficient methods of configuring and scaling the cluster, (iii) natural resiliency of clusters and/or proxy nodes within a cluster, (iv) efficient scaling of clusters, (v) configurability of clusters to span multiple servers, multiple data racks and multiple data centers, (vi) switching between proxies in case of a proxy failure or between servers in case of failure of a server, rack or data center (for example in case of loss of power, internet, hardware or software errors etc.), without loss of session information, and (vii) responsive to server failure (for enabling reconnection of a client device with a backup server in the same datacenter or in a different datacenter—thereby ensuring high availability and disaster recovery.

For the purposes of the invention “client” shall mean any device having information processing and network communication capabilities. The types of clients may vary widely and include but are not limited to desktop computers, laptop computers or notebook computers, personal digital assistants, handheld computers, cellular phones, servers and Internet of Things (IOT) sensors or devices or gateways or servers.

For the purposes of the present invention, “proxy” or “proxy node” shall mean any device having information processing and network communication capabilities that may in various non limiting examples be configured for one or more of securing or capturing data samples of data traffic passing through such proxies, routing communications between one or more clients and one or more servers, load balancing and forwarding functions. The types of proxies may vary widely and include but are not limited to full proxies, half proxies, security proxies, IOT proxies or load balancing proxies.

For the purposes of the present invention, “proxy cluster” or “cluster of proxies” shall mean a plurality of proxy nodes. For the purposes of the present invention, proxy nodes within a proxy cluster may be understood as being interconnected in an overlay network.

For the purposes of the invention, “server” shall mean any device having information processing and network communication capabilities, and which is configured to provide one or more services to a requesting client, over a communication network. The types of servers may vary widely, and include but are not limited to API servers, Applications Servers, Microservices, web servers, FTP servers, IOT brokers or servers or gateways, message brokers, or service oriented architecture (SOA) servers.

For the purposes of the invention, “server backend” shall mean a set of one or more servers.

FIG. 2 illustrates an embodiment of the invention, wherein proxy cluster 204 is disposed as a network intermediate between clients 202 (i.e. clients 202 a, 202 b and 202 c) and server backend 206 (i.e. servers 206 a, 206 b and 206 c). Each client request or client message directed towards server backend 206, and each response from server backend 206 is routed through proxy cluster 204.

As illustrated in FIG. 2, proxy cluster 204 comprises a plurality of proxy nodes (proxy nodes 204 a, 204 b and 204 c). A client request or message for server backend 206 is routed to a proxy node within proxy cluster 204. Based on routing policies and/or other information available to the specific proxy node, said proxy node routes the client request to an appropriate server within server backend 206. Server responses to the client request or message are likewise transmitted from the server back to the specific proxy node, and onward from the specific proxy node to the client from which the request or message originated.

In the embodiment illustrated in FIG. 2 requests or messages from client 202 a are routed to proxy node 204 a within proxy cluster 204. Said requests or messages are thereafter routed to servers 206 a and 206 b within server backend 206. Likewise, a request from client 202 b is routed to proxy node 204 b and onward to server 206 c within server backend 206. A request from client 202 c is routed to proxy node 204 c and onward to server 206 b in server backend 206. Responses from the servers 206 a to 206 c are routed back to the corresponding requesting client through the same proxy node from which the request or message was received by the server.

The decision to route a client request or message to a specific proxy node within proxy cluster 204, may in an embodiment of the invention be based on routing logic or routing policies within DNS server 208 or other name server. Exemplary load balancing or connection parameters that the routing policies of DNS server 208 may rely on for selecting a specific proxy node within proxy cluster 204 may include one or more of location of the requesting client, location of the target server(s), existing load balance among proxies within the proxy clusters, content and/or type of request etc.

Selection of a target server (within server backend 206) by a proxy node within proxy cluster 204 may be determined based on routing logic or routing policies specified for the proxy node. In a specific embodiment of the invention, a plurality of proxy nodes within proxy cluster 204 (and preferably all proxy nodes within proxy cluster 204) may be configured to use identical routing policies for selecting a target server.

FIG. 3 illustrates another embodiment of the invention, wherein proxy cluster 304 is disposed as a network intermediate between clients 302 (i.e. clients 302 a, 302 b and 302 c) and server backend 306. In the illustrated embodiment, server backend 306 comprises server cluster 3061 (in turn comprising servers 3061 a and 3061 b) corresponding to a first data center (data center 1) and server cluster 3062 (in turn comprising servers 3062 a and 3062 b) corresponding to a second data center (data center 2).

As in the case of FIG. 2, a client request or message relating to services from server backend 306 are routed through a proxy node within proxy cluster 304. In the illustrated embodiment, proxy cluster 304 comprises a plurality of proxy nodes 304 a and 304 b. A client request or message for one or more services from server backend 306 are routed to a specific proxy node within proxy cluster 304. The specific proxy node routes the client request or message to an appropriate server within server backend 306 and server responses are likewise transmitted from the server back to the specific proxy node, and onward to the requesting client.

As would be observed from the illustration in FIG. 3, client requests for (or messages to) servers within data center 1 are routed to a first proxy node (i.e. proxy node 304 a), while client requests for (or messages to) servers within data center 2 are routed to a second proxy node (i.e. proxy node 304 b). The decision to route a client request to a specific proxy node based on location of a target server, may in an embodiment of the invention be based on routing logic or routing policies within DNS server 308. In an embodiment of the invention, proxy nodes (304 a, 304 b) within proxy cluster 304 may be configured such that in case of failure of a server located within a specific server rack or a specific data center, the proxy node receiving client requests or messages targeted at the failed server may instead route such requests or messages to a backup server/mirror server/peer server providing the same functionality as the failed server. The configuration of proxy nodes within proxy cluster 304 ensures that such re-routing may be effected regardless of whether the failed server and backup server/mirror server/peer server are located within the same server rack or the same data center, or are located across multiple server racks and/or across multiple data centers. In an embodiment of the invention, proxy cluster 304 may be configured to respond to server failure by re-routing client messages or requests to a backup server/mirror server/peer server despite simultaneous failure of the proxy node that was previously receiving client requests or messages targeted at the failed server. Proxy cluster 304 may achieve this by substituting an operational proxy node within the proxy cluster for the failed proxy node—and may in an embodiment (discussed in more detail hereinafter) implement such substitution of proxy nodes and consequent re-routing of client messages or requests to a backup server/mirror server/peer server without having to re-initialize a client or user session (i.e. without loss of session data).

As in the case of FIG. 2, selection of a target server (within server backend 306) by a proxy node within proxy cluster 304 may be determined based on routing logic or routing policies provisioned for the proxy node. In a preferred embodiment, a plurality of proxy nodes within proxy cluster 304 (and preferably all proxy nodes within proxy cluster 304) are provisioned with identical routing policies for selecting a target server.

FIG. 4 illustrates an embodiment of a proxy node configured for implementation within a scalable proxy cluster of the present invention. Proxy node 400 comprises a proxy router 402 and a synchronization controller 404. Proxy node 400 may additionally include or enable access to one or more repositories of data associated with proxy node 400, said repositories of data comprising (i) server characteristic data 406, (ii) session data 408, (iii) security data 410, (iv) configuration data 412, and (v) proxy node data 414. One or more repositories comprising the above data may in various embodiment of the invention be accessible by one or both of proxy router 402 and synchronization controller 404.

Server characteristic data 406 comprises information identifying one or more characteristics of one or more servers within the server backend i.e. information that is descriptive of configuration, interfaces, and/or functionality of one or more servers within the server backend to which a proxy node is configured to route client requests or messages. In an embodiment of the invention, server characteristic data 406 includes one or more of (i) network sockets corresponding to servers, (ii) TCP, HTTP/WebSocket, Request/Response, streaming and/or Publish/Subscribe message patterns for accessing servers (iii) business logic execution engine(s) implemented within servers (iv) backend connectivity between a server and other servers, (v) applications implemented on servers, and/or (vi) database systems relied on or implemented within servers.

Session data 408 comprises information identifying one or more characteristics of users/clients communicating through a proxy node. In an embodiment of the invention, session data 408 comprises one or more of (i) cookies, (ii) tokens, (iii) client ids and/or (iv) device ids. In a more specific embodiment of the invention, session data 408 may be limited to information that is active (i.e. that has not expired) in accordance with session expiry policies of one or more servers within the server backend to which a proxy node is configured to route client requests or messages.

Security data 410 comprises Transport Layer Security/Secure Sockets Layer (TLS/SSL) security data corresponding to each session that is active (i.e. that has not expired) in accordance with applicable session expiry policies. In an embodiment of the invention, security data 410 may comprise one or more of cipher suites, digital certificates (including one or more of server name, a trusted certificate authority (CA) and a backend server's public encryption key), session keys and/or asymmetric and symmetric ciphers that have been received at proxy node 400.

Configuration data 412 comprises configuration information that a proxy node requires to effect routing of incoming client requests or messages to a server within the server backend in one or more data centers. In an embodiment of the invention, configuration data 412 may comprise one or more of (i) data port information and/or other routing information corresponding to one or more servers within a server backend, (ii) load balancing or routing policies, (iii) load balancing and/or routing techniques (iv) management ports, (v) maximum number of processes/threads for each port, (vi) policies for generating logs (i.e. policies regarding what events or information to log, event triggers for logging and log persistence and/or management policies) and/or (vii) firewall settings corresponding to one or more servers within the server backend.

Proxy node data 414 comprises information identifying live or active proxy nodes (other than proxy node 400) within the proxy cluster. In an embodiment of the invention proxy node data 414 may comprise one or more of hardware identification information, IP address information and/or network routing information corresponding to such live or active proxy nodes within the proxy cluster.

Proxy router 402 comprises a processor based controller that is configured to (i) receive client requests or client messages, and (ii) responsive to received requests or messages satisfying one or more predefined criteria, transmitting said requests or messages onward to one or more server(s) within server backend 206, 306. Proxy router 402 is a controller configured to implement predefined routing logic or routing policies on client requests or messages received at a proxy node—to ensure that legitimate client requests or messages are transmitted onwards to a server configured to respond to such requests or messages. In an embodiment of the invention, in implementing onward transmission of received client requests or messages to one or more servers, proxy router 402 may rely on one or more of server characteristic data 406, session data 408, security data 410 and configuration data 412 that is associated with and accessible to proxy node 400.

Synchronization controller 404 comprises a processor based controller that is configured to respond to a predefined synchronization event or synchronization trigger by synchronizing (i) a data state of one or more of server characteristic data 406, session data 408, security data 410, configuration data 412 and proxy node data 414 that is associated with said proxy node, with (ii) a data state of corresponding server characteristic data, session data, security data, configuration data and/or proxy node data associated with another proxy node within proxy cluster 204, 304. In an embodiment of the invention, synchronization controller 404 is configured to synchronize data states of one or more (and preferably all) of server characteristic data 406, session data 408, security data 410, configuration data 412 and proxy node data 414 associated with said proxy node, with (ii) data states of corresponding server characteristic data, session data, security data, configuration data and proxy node data associated with every other proxy node within proxy cluster 204, 304. It would be understood that in an embodiment of the invention, synchronization of data states may involve synchronization of state changes that have occurred since a previous synchronization event. In an embodiment of the invention, synchronization controller 404 may be configured to establish distinct read and write connections with each proxy node that it synchronizes with. In an embodiment, the distinct read and write connections with each proxy node that a synchronization controller 404 synchronizes with, may be implemented by initializing separate read and write pipe endpoints for each such proxy node.

In a preferred embodiment of the invention, every proxy node within proxy cluster 204, 304 may comprise an instance of proxy node 400. Since synchronization controller 404 of each proxy node within the cluster is configured to ensure synchronization of the above mentioned proxy node data states with corresponding data states of every other proxy node within the cluster, the synchronization process results in all proxy nodes within the cluster having an identical data state corresponding to one or more (and preferably all) of server characteristic data, session data, security data, configuration data and proxy node data.

As discussed above, in implementing onward transmission of received client requests or messages to one or more servers, proxy router 402 within each proxy node 400 may rely on one or more of server characteristic data, session data, security data and configuration data that is associated with or accessible to proxy node 400. By configuring synchronization controller 404 within each proxy node 400 to ensure synchronization of each set of data that proxy router 402 relies on for implementing routing/onward transmission functionality, proxy cluster 204, 304 can be configured for self-learning, wherein any proxy node within the proxy cluster achieves the necessary functionality required by all proxy nodes in the proxy cluster, without requiring configuration by an operator or administrator. This ensures that in the case of a proxy node failure, upon resumption of service the failed proxy node synchronizes with other proxy nodes and is ready to resume operations without further loss of time and without further configuration. In an embodiment, the synchronization between proxy nodes additionally ensures that every proxy node within said proxy cluster performs routing/onward transmission functions identically (i.e. a specific client request or message will undergo identical routing/onward transmission by a recipient proxy node, regardless of which proxy node (within the proxy cluster) the client request or message is received at).

FIG. 5 illustrates the embodiment of the invention where proxy cluster 504 comprises four proxy nodes 500 a to 500 d. While FIG. 5 illustrates a proxy cluster comprising only four proxy nodes, it will be understood that the four proxy nodes are only illustrative and that the proxy cluster may be scaled up or down to include any number of proxy nodes. Each proxy node 500 a to 500 d within proxy cluster 504 may comprise an instance of proxy node 400. Accordingly, the synchronization process between proxy nodes 500 a to 500 d may result in all proxy nodes within the cluster having identical data states corresponding to one or more (and preferably all) of server characteristic data 406, session data 408, security data 410, configuration data 412 and proxy node data 414. As illustrated in FIG. 5, the synchronization of data states between proxy nodes results in a peer-to-peer synchronization configuration within proxy cluster 504—wherein each proxy node 500 a to 500 d is a peer node within the peer-to-peer synchronization configuration.

In an embodiment of the invention, each proxy node within a proxy cluster periodically carries out a heartbeat messaging procedure (i.e. a ping-pong message/response procedure) with all other proxy nodes and updates its list of active peer nodes (i.e. proxy node data 414) depending on whether the heartbeat messaging procedure returns an error.

FIG. 6 illustrates a method of achieving peer-to-peer synchronization between proxy nodes within a proxy cluster. For the purposes of FIG. 6, it will be understood that each peer node is a proxy node 400 of the type described in connection with FIG. 4. Further, the method of peer-to-peer synchronization in FIG. 6 is described in terms of achieving data state synchronization between a proxy node and all peer proxy nodes (i.e. all other proxy nodes) within the proxy cluster.

Step 602 of FIG. 6 comprises detection of a predefined synchronization trigger event at a proxy node. The synchronization trigger event may comprise any predefined event based trigger—and in an embodiment may comprise a time based event trigger. In an embodiment of the invention, the synchronization trigger event may comprise a trigger instruction generated at a proxy node upon expiry of a predefined time period from the last trigger instruction. In an embodiment of the invention, the synchronization trigger event may comprise a trigger instruction generated when a proxy node is bootstrapped into a proxy cluster, or when a proxy node resumes operations within a proxy cluster subsequent to recovery from a state of failure.

At step 604, responsive to detection of a trigger event at step 602, the proxy node retrieves information identifying peer nodes within the proxy cluster. In an embodiment of the invention, information identifying peer nodes within the proxy cluster may be retrieved from proxy node data 414 associated with proxy node 400.

Step 606 comprises selecting a peer node from among the identified peer nodes. Step 608 thereafter comprises initiating data synchronization at the proxy node—to achieve synchronization of (i) a data state of one or more of server characteristic data, session data, security data, configuration data and proxy node data associated with the proxy node, with (ii) a data state of corresponding server characteristic data, session data, security data, configuration data and proxy node data associated with the selected peer node. In an embodiment of the invention, initiating data synchronization at a proxy node comprises establishing distinct read and write connections with every other proxy node that said proxy node synchronizes with. In an embodiment, the distinct read and write connections with every other proxy node that the proxy node synchronizes with, may be implemented by initializing separate read and write pipe endpoints for every such other proxy node.

Step 610 comprises repeating steps 606 and 608 to achieve synchronization of data states (corresponding to the selected data parameters) within the proxy node with corresponding data states within every other identified peer node in the proxy cluster.

By implementing method steps of FIG. 6 across all peer proxy nodes within a proxy cluster, the method ensures that all proxy nodes within the proxy cluster have synchronized data states corresponding to one or more of server characteristic data, session data, security data, configuration data and proxy node data. By appropriately selecting parameters for synchronization of data states across proxy nodes, the method of FIG. 6 can ensure that every proxy node within the proxy cluster performs routing/onward transmission functions identically.

FIG. 7 illustrates an exemplary method for bootstrapping a new processing node and adding the new processing node as a proxy node within an existing proxy cluster. It would be understood that the method of FIG. 7 may be used for scaling the proxy cluster up in response to an increased demand for proxy nodes within the cluster.

Step 702 comprises configuring the new processing node for operating as a proxy node within a proxy cluster. Configuring the new processing node may comprise configuring one or more processors associated with the new processing node to implement functions of proxy router 402 and synchronization controller 404 that have been illustrated and described in connection with FIG. 4. In an embodiment, configuring the new processing node may comprise providing program instructions that enable the one or more processors associated with the new processing node to implement one or both of proxy router functionality and synchronization controller functionality.

Step 704 comprises provisioning the new processing node with an identifier of at least one existing peer proxy node within the proxy cluster. In an embodiment of the method, the identifier information provisioned at step 704 may comprise an IP address (or any other information described previously in connection with proxy node data 414 of FIG. 4) corresponding to at least one live or active peer node within the proxy cluster.

Step 706 comprises bootstrapping the new node into the overlay network formed by peer nodes in the proxy cluster, and initiating at the new node, a data state synchronization with at least one peer node of which the new node is aware. The data state synchronization between the new node and the peer node may in a preferred embodiment involve synchronization of data states of the new node with data states of the one peer node—in respect of one or more (and preferably all) of server characteristic data, session data, security data, configuration data and proxy node data associated with said proxy node.

In the process of data state synchronization either (i) the new node receives information regarding all other peer nodes identified within the proxy node data 414 corresponding to the peer node, or (ii) the peer node broadcasts address information corresponding to the new node to every other peer node identified within proxy node data 414 corresponding to said peer node—both of which methods (when coupled with the step of data synchronization between all proxy nodes within the proxy cluster) result in data state synchronization between the new node and all other peer nodes within the proxy cluster. By implementation of data state synchronization between the new node and every peer node within the proxy cluster, the new node achieves data state synchronization with all pre-existing peer nodes within the proxy cluster—thereby achieving the status of a fully synchronized peer node within the proxy cluster.

While not specifically illustrated, it would also be understood that each pre-existing peer node within the proxy cluster updates its proxy node data 414 to include received identifier or address information corresponding to each new peer node, thereby adding to its own list of active peer nodes with which data state synchronization requires to be implemented in response to the next synchronization event or synchronization trigger.

In a preferred embodiment, bootstrapping the new node into the proxy cluster may additionally include adding or modifying (automatically or manually) one or more DNS server entries corresponding to one or more servers within the server backend that is serviced by the proxy cluster, wherein the added or modified DNS server entries comprises address information corresponding to the new node, and may also include data that is determinative of routing policies that may be applied by the DNS server for routing client requests or messages to the new node.

It would likewise be understood that one or more nodes may be removed from the proxy cluster to scale the proxy cluster down in response to a decreased demand for proxy nodes. In one embodiment, this process may comprise removal of a peer node from network communication with the remaining peer nodes, and updating proxy node data in at least one of the remaining peer nodes—which updating comprises removal of the removed peer node from the list of active peer nodes. By virtue of periodic data synchronization, this updating of proxy node data will be communicated to (and updated at) all other peer nodes within the proxy network in response to the next synchronization event or synchronization trigger. In another embodiment, failure of any peer node to response to a predefined number of heartbeat messages (ping messages) may result in the peer node being marked as inactive and/or deleted from proxy node data 414 within each proxy node Likewise DNS server entries corresponding to a removed peer node may be automatically modified to reflect the inactive/removed status in response to one or more error messages received in response to requests or messages routed to the removed peer node. In another embodiment, DNS server entries may be updated by a proxy node within the proxy cluster (or by the proxy cluster) in response to a new proxy node being added to the proxy cluster or a previously operational proxy node being removed from the proxy cluster. In case of removal or failure of a previously operational proxy node, data traffic workload previously being handled by the removed or failed node will be handled by one or more of the remaining proxy nodes within the proxy cluster—until the removed or failed proxy node resumes operations, at which time it synchronizes with one or more of the remaining proxy nodes in the proxy cluster and resumes operations as before.

In an embodiment of the invention, the proxy cluster may be accessible through a command-line-interpreter (CLI) or a RESTful API (REST API) which permits configuration inputs and/or configuration modifications at any one or more proxy nodes within the proxy cluster. In an embodiment of the invention, any additions or modifications to any one of server characteristic data, session data, security data, configuration data and proxy node data associated with proxy nodes within the proxy cluster, may be implemented at any one of the proxy nodes through the CLI or REST API—which additions or modifications will be automatically communicated to and implemented within all other proxy nodes within the proxy cluster by way of data state synchronizations implemented by each proxy node within the proxy cluster. It would accordingly be understood that implementation of data state synchronizations between proxy nodes within a proxy cluster presents significant advantages in terms of ease of administration and configuration of nodes within the proxy cluster.

In an embodiment of the invention, the proxy cluster can support multiple communication and/or messaging protocols, both at the node level and at the cluster level. Exemplary non-limiting examples of protocols that can be supported at the node level or at the cluster level within the proxy cluster include (i) HTTP/HTTPS, (ii) WebSocket/WebSocket over SSL, (iii) MQTT/WebSocket, (iv) MQTT/WebSocket over SSL, (v) MQTT/TCP, (vi) MQTT/TLS, and (vii) CoAP.

Based on the above, it would be understood that proxy clusters in accordance with the teachings of the present invention offer multiple advantages over solutions known in the prior art.

A primary advantage comprises scalability or elasticity in terms of (i) a number of users/clients that can be served, (ii) type of client devices that can be served within the same or across different data centers, (iii) a number of servers within the server backend that can be served, and/or (iv) a number of protocols that can be served or accommodated.

The invention also enables proxy nodes to be added to or removed from a proxy cluster with minimal configuration overheads. By enabling ready upward and downward scaling, the invention presents significant cost advantages over the prior art. The invention further avoids dependence on a master-slave or a centralized control model—which makes proxy clusters of the present invention simpler to configure as well as to scale.

In addition to the above, proxy clusters of the present invention avoid any single point of failure—by providing alternative proxy nodes for routing of client requests or messages in case of failure of one or more proxy nodes. This natural resiliency ensures high availability. Additionally, assuming a specific proxy node fails but later comes back online, the data state synchronization process ensures that the revived proxy node is synchronized with all other peer proxy nodes in the course of the next synchronization event—and that such proxy node can therefore resume full functionality upon conclusion of the next synchronization event.

The resilience and data state synchronization aspects of the proxy cluster also enable real time and “live” reconfiguration or updates of the proxies to reflect changes to APIs or API servers within the server backend or to provide new routing instructions. In the process of reconfiguration or updating of one or more servers, corresponding server characteristic data 408 within one of the proxy nodes 400 is updated. While such updating of information occurs at the proxy node 400, the proxy node is unable to continue providing functionality as a proxy node within the proxy cluster—during which period the remaining proxy nodes in the proxy cluster continue to route information between clients and servers and thereby ensure high availability. Once the updating of information is complete, the updated proxy node is rebooted and brought back online within the proxy cluster. The updated information now available at the updated proxy node may be propagated to all other proxy nodes within the proxy cluster during the next data state synchronization event—which ensures updating of information throughout the proxy cluster, without the proxy cluster suffering a system failure or outage at any point of time. It would be understood that functioning of the proxy cluster, and transmission of client requests or messages between clients and the server backend can continue without observable disruption while the above described server updates/reconfiguration, proxy node updating and proxy node synchronization is implemented.

Since in certain embodiments, synchronization of data states includes synchronization of session data 408 and security data 410, a user or client device may be shifted from one proxy node within a proxy cluster to another proxy node within the cluster (for example, in response to failure of a proxy node, or in response to a shift due to a change in state of a user or client device) without having to reinitialize a user session or client session.

In an embodiment of the invention, the proxy cluster is a non-symmetric cluster, in that at least two proxy nodes within the proxy cluster have different operating specifications in terms of one or more of hardware, operating systems and/or application software.

It would be understood that proxy clusters in accordance with the teachings of the present invention can be configured to span one or more server backends that are implemented across a plurality of server racks and/or a plurality of datacenters. In addition to enabling scalability of the proxy cluster, the invention enables geography specific scaling of proxy clusters based on local demand, as well as setting up of cross-border datacenters. The data state synchronizations of the invention additionally ensure persistence of data and metadata across all proxy nodes within a proxy cluster—which data and metadata may span a server backend that has been set up across multiple datacenters. Persistence of data and metadata across data centers has been found to present significant advantages over the earlier state of art—particularly in view that synchronizing session state data ensures true session recovery (for high availability and/or disaster recovery) by servers in a second datacenter in case of failure of servers within a first datacenter—thereby ensuring that an ongoing session and/or business logic within an ongoing session can be continued without having to re-initialize the session.

The proxy cluster may be configured to span at least two data centers, wherein the secondary data center (and servers therein) is configured to provide disaster recovery support for the primary data center. In an embodiment, the proxy cluster is configured such that in case of a disaster event causing entire failure of the primary data center (including one or more proxy nodes corresponding to said primary data center), the DNS server (or other name server) is reconfigured (automatically or manually) to route client requests or messages to the secondary data center without causing disruption to the end user(s). In another embodiment, the proxy cluster is configured such that in the event the primary data center (or servers therein) fails without a corresponding failure of proxy nodes servicing said primary data center, client requests or messages corresponding to client sessions that were previously being serviced by the primary data center are subsequently routed to one or more servers within the secondary data center.

FIG. 8 illustrates an exemplary system in which various embodiments of the invention, including one or more proxy nodes within a proxy cluster, may be implemented.

The system 802 comprises at-least one processor 804 and at-least one memory 806. The processor 804 executes program instructions and may be a real processor. The processor 804 may also be a virtual processor. The computer system 802 is not intended to suggest any limitation as to scope of use or functionality of described embodiments. For example, the computer system 802 may include, but not limited to, one or more of a general-purpose computer, a programmed microprocessor, a micro-controller, an integrated circuit, and other devices or arrangements of devices that are capable of implementing the steps that constitute the method of the present invention. In an embodiment of the present invention, the memory 806 may store software for implementing various embodiments of the present invention. The computer system 802 may have additional components. For example, the computer system 802 includes one or more communication channels 808, one or more input devices 810, one or more output devices 812, and storage 814. An interconnection mechanism (not shown) such as a bus, controller, or network, interconnects the components of the computer system 802. In various embodiments of the present invention, operating system software (not shown) provides an operating environment for various softwares executing in the computer system 802, and manages different functionalities of the components of the computer system 802.

The communication channel(s) 808 allow communication over a communication medium to various other computing entities. The communication medium provides information such as program instructions, or other data in a communication media. The communication media includes, but not limited to, wired or wireless methodologies implemented with an electrical, optical, RF, infrared, acoustic, microwave, bluetooth or other transmission media.

The input device(s) 810 may include, but not limited to, a touch screen, a keyboard, mouse, pen, joystick, trackball, a voice device, a scanning device, or any another device that is capable of providing input to the computer system 802. In an embodiment of the present invention, the input device(s) 810 may be a sound card or similar device that accepts audio input in analog or digital form. The output device(s) 812 may include, but not limited to, a user interface on CRT or LCD, printer, speaker, CD/DVD writer, or any other device that provides output from the computer system 802.

The storage 814 may include, but not limited to, magnetic disks, magnetic tapes, CD-ROMs, CD-RWs, DVDs, any types of computer memory, magnetic stripes, smart cards, printed barcodes or any other transitory or non-transitory medium which can be used to store information and can be accessed by the computer system 802. In various embodiments of the present invention, the storage 814 contains program instructions for implementing the described embodiments.

While not illustrated in FIG. 8, the system of FIG. 8 may further include some or all of the components of a proxy node of the type more fully described in connection with FIG. 4 herein above.

The present invention may be implemented in numerous ways including as a system, a method, or a computer program product such as a computer readable storage medium or a computer network wherein programming instructions are communicated from a remote location.

The present invention may suitably be embodied as a computer program product for use with the computer system 802. The method described herein is typically implemented as a computer program product, comprising a set of program instructions which is executed by the computer system 802 or any other similar device. The set of program instructions may be a series of computer readable codes stored on a tangible medium, such as a computer readable storage medium (storage 814), for example, diskette, CD-ROM, ROM, flash drives or hard disk, or transmittable to the computer system 802, via a modem or other interface device, over either a tangible medium, including but not limited to optical or analogue communications channel(s) 808. The implementation of the invention as a computer program product may be in an intangible form using wireless techniques, including but not limited to microwave, infrared, bluetooth or other transmission techniques. These instructions can be preloaded into a system or recorded on a storage medium such as a CD-ROM, or made available for downloading over a network such as the Internet or a mobile telephone network. The series of computer readable instructions may embody all or part of the functionality previously described herein.

While the exemplary embodiments of the present invention are described and illustrated herein, it will be appreciated that they are merely illustrative. It will be understood by those skilled in the art that various modifications in form and detail may be made therein without departing from or offending the spirit and scope of the invention as defined by the appended claims.

Embodiments of the invention relate to the field of Layer 7 software networking using any protocol, including HTTP and WebSocket protocols, with and without encryption such as TLS/SSL, and relates to the field of APIS and other programmatic interfaces.

The current state of the art is a single HTTP protocol server level load balancing at Layer 7 with no Application or API focus/relationships.

The objective of embodiments of the present invention is to scale, track and secure all types of API's in the cloud datacenter using software defined networking with elasticity for mobile apps, HTML5, web apps, IoT devices, etc.

Embodiments of the present invention are directed at API load balancing with API connection/access tracking and firewall capabilities for services running in cloud datacenters.

FIG. 9 illustrates one embodiment of a software architecture for an API Load Balancer with API access tracking and Firewall capabilities. The unique architecture blends Layer 7 networking protocols with API capabilities and provides a single integrated solution for load balancing, tracking access, and securing APIs.

In one embodiment, API Load Balancer is an Event Driven Architecture with dual data plane for text and encrypted protocols with a shared single control plane. This architecture allows the handling of complex API requests from many different types of clients (HTML5, Mobile, IoT, etc.) and uses less computing and memory resources for scale and security functions—while delivering improved performance.

A dual data plane with multiple processes allows handling large number of TCP/SSL connections using both our event driven implementation and the availability of underlying file descriptors from the operating system. The separation of data planes for text and encrypted protocols allows setting different policies for security and increases the protection of backend APIs. The modular protocol engine inside the data plane (balancer) allows adding more (in addition to HTTP and WebSocket) Layer 7 protocols based on market needs— MQTT is one of them.

In one embodiment, the control plane is responsible for the entire life cycle of the API load balancer including launching balancer processes, clustering, configuring, auto-scaling, and exposing internal CLI/REST APIs for management. The controller is native to the multiple balancers functionality and uses IPC mechanisms (Unix Local Socket) for communication. This integrated model removes external single points of failure and allows implementers/DevOps to scale API load balancers easily as a single unit.

The API load balancer is native C++ code running in the Linux OS' user space. It also supports cookie persistence by means of a permanent storage device such as an SSD drive using a NoSQL database. The cluster supports mirroring of configurations and cookies across all instances of API Load balancers. An administrator/DevOps focused CLI is exposed for management. Internal REST APIs allows backend API servers to add, update, modify, and delete server pool in the configuration for auto scaling.

The application centric infrastructure inside datacenters is going through a major transition from static, hardware-driven to dynamic, software-driven infrastructures due to 1) the high growth of devices and apps getting deployed and connected to the internet and 2) the efficiency that the software-only model brings to IT organizations.

A second significant change is the exploding use of APIs as APIs allow data and compute services to be reused, shared, and monetized enabling organizations to innovate faster. Developers and DevOps rely more than ever on API's to speed up the delivery of new applications by using disparate pieces of existing application code as building blocks for their offering.

In this modern Cloud-first and API-first software-defined datacenter, there is a need for elastic software solutions to handle multiple Layer 7 protocols for load balancing, scaling, securing, firewalling, and tracking all APIs accessed.

The API Load Balancer, as illustrated in FIG. 9, may include any combination of the following embodiments in the software architecture. The following is the list and description of each of the embodiments:

1. Multi-Protocol Cookie-based and Token-based Stickiness for both HTTP and WebSocket Protocols

The “cookie” in web browsers and the “token” in mobile apps are primary identity technologies for connection authentication and authorization. Modern web services applications expose thousands of API's with multiple protocols for communicating with apps. One embodiment discussed herein dynamically learns the cookie or token from the API response and applies the same cookie or token when the client's request comes back in either the HTTP or WebSocket connection and then routes the request automatically and dynamically to the correct back-end API server. This embodiment uses a unique combination of HTTP header and underlying TCP connection keep-alive techniques for providing cookie-based stickiness for both HTTP and WebSocket. An engine of the present embodiment learns the set-cookie or token from the API response, remembers it as being part of the underlying TCP connection that is being kept alive by our engine, and if the client upgrades the connection to WebSocket, the engine uses the same TCP connection to connect back to the same API server which authenticated the initial request—creating cookie or token stickiness as a result for the end-to-end session.

2. Separation of Control and Data Planes—with Dual Data Planes

One embodiment discussed herein simplifies the management of multiple connection types and end points. The separation of concern enables the creation of a single layer 7 control functions to support multiple data planes, also called balancers The embodiment defines how a single controller creates 2 separate master processes—one for port 80 and one for port 443. Each master process creates its own set of child processes to handle all connections associated with its port. The controller identifies all port 443 connections automatically to initiate SSL certificate operations. This embodiment is achieved by using a modular configuration for each back-end API being connected to and using a bi-directional IPC mechanism to enable communications between controller and balancers.

3. Multiple Processes for a Single Port for Each Data Plane

One embodiment enables the support of a greater number of TCP/SSL connections than is possible today. Today a single process is attached to each TCP port in a Unix/Linux server. This embodiment enables the load balancer to attach multiple processes to a single TCP port on a Unix/Linux server in order to handle a large number of TCP/SSL connections. A master balancer process is created by the controller which also tells the master how many child processes can be created. An IPC mechanism and a shared memory between the master and the various child processes are used to share a number of parameters such as configurations information, cookies, tokens, routing logic etc. This allows multiple child processes to be connected to the same TCP port. This technique is dynamic and optimizes the underlying CPU, Memory, and IO available from the server. This technique can be referred to as vertical scaling in Layer 7 network routing.

4. API Level Load Balancing for REST and WebSocket API's

Existing server load balancers use 2 parameters (IP address and Port number) for load balancing all requests to back-end web and app servers. One embodiment uses 3 parameters (IP address, Port number and API name) for load balancing requests directly to the back-end APIs being used by the client application. This embodiment (which can be considered a 3 parameter method) allows accessing APIs directly as opposed to having the TCP connection made first with the server hosting the APIs and then relying on the server to create a path to the API as is the case with the 2 parameter load balancing method. The embodiment also allows separating different types of APIs, such as REST APIs and WebSocket APIs, and setting different policies for polling vs streaming of data. The client TCP connection is inspected at the URL level and the path name is decoded to understand the API name associated with the host name. The API information is then matched by our balancer using our own configuration mapping file to correctly identify the back-end API server. The server TCP connection is then established at the API level and packets are subsequently transferred directly to the API.

5. JSON-based API Mapping and Configuration File

One embodiment, as discussed herein, is to simplify and automate the configuration for the balancer using JSON format with API details as oppose to server details, The following is the format of this embodiment:

{ “api_metadata”: { “protocol”: “ws”, “url”: “/websocket_echo/*” “cookie”: “JSESSIONID”, “timeout”: “20”, “logout_api_enabled”: “true”, “health_check_interval”: “60”, “health_retry_count”: “4”, “health_request_path”: “/websocket_echo/websocket/health”, “health_retry”: “true”, “health_ping_string”: “ping”, “health_pong_string”: “pong”, “cookie_persistence_enabled”: “true”, “memory_size”: “128MB”, “servers”: [ { “ip”: “127.0.0.1”, “port”: “8080” }, { “ip”: “127.0.0.1”, “port”: “8090” } ] }  }

6. Distributed Cluster Across Datacenters

One embodiment allows a client app to transparently connect to multiple servers, racks or datacenters regardless of physical location or Cloud type. This embodiment uses the Elected Coordinator and Node concepts combined with persistence to cluster multiple balancers and associate them to a single back-end API. A cluster of balancers can span multiple servers, racks or datacenters—including both private and public Clouds (also called hybrid environments) such as Amazon AWS or Microsoft Azure for example. By sharing our own API mapping and configuration files across all balancers inside a cluster, we allow a client connection to transparently move dynamically from one server to another—even if the other server is in a different Cloud or datacenter. With this embodiment the management of clusters is achieved by notifying the coordinator and having the coordinator update all nodes automatically—for any change. Each node maintains a separate list and is self-contained with all configuration information. This allows each node to serve client requests across servers, racks or datacenters. Each node can physically be deployed in any datacenter.

7. Live Updates of API Servers for Auto Scale

The business continuity of any web application is based on the ability to add, modify and delete back-end APIs without affecting the continuity of the service itself. With existing implementations, changes to back-end API servers force load balancers to be taken out of service so that they can be updated before being put back on line. One embodiment allows for the dynamic update of load balancers configurations without affecting the services supported by APIs and web applications. They are not taken out of service in order to capture the latest back-end service API changes. This embodiment has multiple processes for each port and maintains a shared map for API server pool configuration information. The live update of load balancer configuration information to reflect the changes made to an API server is achieved by dynamically updating the shared map and having each balancer process receive notification from the map update process.

8. Cookie and Token Persistence

Modern web applications use long-lived session persistence including no expiration of cookies or tokens in order to maintain a simpler user experience, including simpler or no login, etc. One embodiment is about sharing cookies and tokens across multiple balancer processes in a single server by using a shared light weight NoSQL database and periodically saving them to an available permanent storage such as an SSD using a background thread without disrupting the client request. Each balancer has the ability to dynamically read back cookies and tokens to memory when needed to serve client requests.

9. Session Mirroring and Correlation

Embodiments of this technique can be referred to as session monitoring across servers, racks or datacenters.

Session mirroring enables end users to access a backend application or service and maintain the same session when switching client platforms—from a mobile phone to a PC for example—or/and switching location, for seamless session continuity. As an example a user on a mobile phone accessing a service in San Francisco, connected to the local datacenter, will maintain his session when switching to a PC after arriving in Los Angeles and now being connected to that local datacenter.

This embodiment uses embodiment #6 (Distributed Clusters across Datacenters) together with embodiment #8 (Cookie and Token Persistence) and the following additional technique. The technique involves maintaining synchronization across all the NoSQL databases used by the balancers to maintain session information, cookies, and tokens. Synchronization is achieved by using a process to monitor each shared NoSQL database for changes and then broadcasting using a real-time broadcasting algorithm these updates to all other NoSQL databases so that the updates will be captured and saved by each one of them. This results in all NoSQL database containing the same information.

This embodiment also exposes an internal API to be used by backend API servers or application servers to access the NoSQL database in order to retrieve the multiple cookies/tokens created for the same user when using different devices (mobile, desktop) to dynamically synchronize sessions. This technique can be referred to as session correlation.

Another embodiment can include the ability to check the expiration of cookies and tokens based on both the timestamp and the API logout information in order to remove them from the database. This embodiment blocks expired connections from possibly going to backend APIs again in order to improve performance and security.

10. API Firewall

API security is critical for data protection. Today there are major attacks taking place around the world to access government or business data and applications. Given that APIs are the interface to access these data and applications, proper defenses are critical.

1) External Name/ID Mapping. One embodiment protects APIs, and any programmatic interface, by mapping a different name, or ID, externally than the one used for the API or the programmatic interface itself. That “external facing” name or ID is the one used with the outside world instead of its given internal name or ID. The client TCP connection uses that external name/ID to access the API or programmatic interface. That connection is terminated by our balancer which then uses our internally configured policy map to identify the right internal API or programmatic interface name to use in order to establish server TCP connections for packet transfer.

The mapping is created using policy based configurations that uniquely map each API name to an external facing newly created name or ID. Management interface enable administrators to modify names and naming methods.

With this embodiment all other API names configured in an API server are now invisible to the client connection since they are not available to the request, unlike existing implementations that make all API names visible to the connections therefore making them prone to attacks.

2) Method Calls Enforcement. Another embodiment is the enforcement of API consumption policies to guarantee that the utilization of an API is in line with its intended usage. This is to prevent hackers from using certain APIs to illegally access data and applications. Our consumption policy enforcement invention checks all incoming API requests at the fine-grained HTTP method levels: GET, POST, DELETE, etc. and validates that the method used corresponds to the type of calls allowed by the API. It will automatically log and can terminate the connections if it's not an allowed method call for the backend API servers. For each API, a specific set of methods is specified in a configuration file when the API is created. When a client request reaches the balancer, the method call being used by the client will be compared against the allowed methods specified in the configuration file to determine whether the request is valid. A variety of actions can then be taken automatically to for example, but not restricted to, terminate, log, and audit trail the connection.

The embodiments covering the enforcement of allowed API method call is equally extended to all APIs and application interfaces used in layer 7 protocols such as for example, but not restricted to SOAP, SIP, HTTP/2, WebSocket/2. This allows our engine to guarantee that a call placed to access data or an application is in line with the intended use of that programmatic interface regardless of the interface calls used.

This embodiment also covers the enforcement of the API used at the layer 7 protocol level for example the specific use of a REST API versus a WebSocket API or other call.

3) Content Type Inspection and Enforcement. This embodiment is about enforcing policies established regarding the use of content type associated with a specific API call. For each API, a specific content type—for either text or binary content—is specified in a configuration file when the API is created. When a client request reaches the balancer, the content type being used by the client will be compared against the allowed content type specified in the configuration file to determine whether the request is valid. A variety of actions can then be taken automatically to for example, but not restricted to, terminate, log, and audit trail the connection.

The content type inspection covers both text and binary content types.

The embodiment covering the enforcement of allowed content type is equally extended to all programmatic and application interfaces used in layer 7 protocols such as for example, but not restricted to, SOAP, SIP, HTTP/2, WebSocket/2. This allows our engine to guarantee that a call placed to access data or an application is in line with the intended use of that programmatic interface regardless of the interface calls used.

11. API Connection Tracking

One embodiment tracks connections end-to-end, capturing the information from both ends of the connection: a) the client app connection information on one end and b) the information about the API and backend application and data accessed on the other end. Information captured includes for example User agent info, device/OS, app, IP address, API name . . . .

This embodiment is able to not only log all “end-to-end” connection information for display, reporting, analysis, but also to feed it through a set of machine learning algorithms to establish baselines in order to detect abnormal situations coming from hacking attempts.

Any combination of the above described embodiments can be used for API load balancing with API connection/access tracking and firewall capabilities for services running in cloud datacenters.

In the preceding description, numerous details are set forth. It will be apparent, however, to one of ordinary skill in the art having the benefit of this disclosure, that the present invention may be practiced without these specific details. In some instances, well-known structures and devices are shown in block diagram form, rather than in detail, in order to avoid obscuring the present invention.

It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the following discussion, it is appreciated that throughout the description, discussions utilizing terms such as “learning”, “defining”, “accessing”, or the like, refer to the actions and processes of a computer system, or similar computing device, that manipulates and transforms data represented as physical (e.g., electronic) quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system memories or registers or other such information storage, transmission or display devices.

The present invention also relates to an apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, and magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, or any type of media suitable for storing electronic instructions.

The processes presented herein are not inherently related to any particular computer or other apparatus. Various general purpose systems may be used with programs in accordance with the teachings herein, or it may prove convenient to construct a more specialized apparatus to perform the required method steps.

It is to be understood that the above description is intended to be illustrative, and not restrictive. Many other embodiments will be apparent to those of skill in the art upon reading and understanding the above description.

The foregoing description, for purpose of explanation, has been described with reference to specific embodiments. However, the illustrative discussions above are not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations are possible in view of the above teachings. The embodiments were chosen and described in order to best explain the principles of the invention and its practical applications, to thereby enable others skilled in the art to best utilize the invention and various embodiments with various modifications as may be suited to the particular use contemplated. 

The invention claimed is:
 1. A method, comprising: detecting, at a first proxy node from a plurality of networked proxy nodes, a synchronization event; in response to the synchronization event, synchronizing data states of the first proxy node with data states of remaining proxy nodes from the plurality of networked proxy nodes such that each proxy node from the plurality of networked proxy nodes has synchronized data states, the synchronized data states including information descriptive of a set of Application Programming Interfaces (APIs) implemented on a plurality of servers and information associated with API requests received at the plurality of networked proxy nodes; sending, from the first proxy node, a first API request to a first server from the plurality of servers based on the synchronized data states, the first server associated with a first data center; and sending, from the first proxy node and via a second proxy node from the plurality of networked proxy nodes, a second API request to a second server from the plurality of servers based on the synchronized data states, the second server being different from the first server and associated with a second data center different from the first data center.
 2. The method of claim 1, further comprising: maintaining routing information stored in a Domain Name System (DNS) server based on the synchronized data states such that the first API request is routed to the first proxy node and the second API request is routed to the second proxy node based on the routing information.
 3. The method of claim 1, wherein the synchronized data states further include: session data including information descriptive of one or more client devices communicating with the plurality of servers via at least one proxy node from the plurality of networked proxy nodes; security data associated with one or more sessions between the one or more client devices and the plurality of servers; configuration data associated with routing the API requests to the plurality of servers; and proxy node data including information descriptive of the plurality of networked proxy nodes.
 4. The method of claim 1, further comprising: in response to the second proxy node becoming inactive, modifying routing information stored in a Domain Name System (DNS) server such that a third API request received after the second API request is routed to a third proxy node from the plurality of networked proxy nodes to be sent to the second server.
 5. The method of claim 1, further comprising: sending, from the first proxy node, messages at periodic intervals to a third proxy node from the plurality of networked proxy nodes; in response to not receiving a response from the third proxy node after sending a predefined number of the messages, modifying proxy node data included in the data states of the first proxy node to: indicate that the third proxy node is inactive, or remove an identifier of the third proxy node from the proxy node data; and sending, after the modifying the proxy node data, the data states of the first proxy node to the remaining proxy nodes from the plurality of networked proxy nodes to synchronize the data states of the plurality of networked proxy nodes.
 6. The method of claim 1, wherein the synchronization event includes at least one of: a proxy node being added to the plurality of networked proxy nodes; a proxy node from the plurality of networked proxy nodes becoming active after a period of inactivity; or an expiration of a predefined period of time.
 7. The method of claim 1, further comprising: synchronizing the data states of the first proxy node with the data states of the remaining proxy nodes at periodic intervals.
 8. The method of claim 1, further comprising: receiving, via a command-line interpreter (CLI) or a RESTful API (REST API), an input to modify information included in the data states of the first proxy node; modifying the data states of the first proxy node based on the input; and sending, after the modifying the data states of the first proxy node, the data states of the first proxy node to the remaining proxy nodes from the plurality of networked proxy nodes to synchronize the data states of the plurality of networked proxy nodes.
 9. An apparatus, comprising: a memory; and a processor associated with a first proxy node from a plurality of networked proxy nodes, the processor operably coupled to the memory and configured to: detect a synchronization event; in response to the synchronization event, synchronize data states of the first proxy node with data states of remaining proxy nodes from the plurality of networked proxy nodes such that each proxy node from the plurality of networked proxy nodes has synchronized data states, the synchronized data states including information descriptive of a set of Application Programming Interfaces (APIs) implemented on a plurality of servers and information associated with API requests received at the plurality of networked proxy nodes; send a first API request to a first server from the plurality of servers based on the synchronized data states, the first server associated with a first data center; and send, via a second proxy node from the plurality of networked proxy nodes, a second API request to a second server from the plurality of servers based on the synchronized data states, the second server being different from the first server and associated with a second data center different from the first data center.
 10. The apparatus of claim 9, wherein the processor is further configured to: maintain routing information stored in a Domain Name System (DNS) server based on the synchronized data states such that the first API request is routed to the first proxy node and the second API request is routed to the second proxy node based on the routing information.
 11. The apparatus of claim 9, wherein the synchronized data states further include: session data including information descriptive of one or more client devices communicating with the plurality of servers via at least one proxy node from the plurality of networked proxy nodes; security data associated with one or more sessions between the one or more client devices and the plurality of servers; configuration data associated with routing the API requests to the plurality of servers; and proxy node data including information descriptive of the plurality of networked proxy nodes.
 12. The apparatus of claim 9, wherein the processor is further configured to: in response to the second proxy node becoming inactive, modify routing information stored in a Domain Name System (DNS) server such that a third API request received after the second API request is routed to a third proxy node from the plurality of networked proxy nodes to be sent to the second server.
 13. The apparatus of claim 9, wherein the processor is further configured to: send messages at periodic intervals to a third proxy node from the plurality of networked proxy nodes; in response to not receiving a response from the third proxy node after sending a predefined number of the messages, modify proxy node data included in the data states of the first proxy node to: indicate that the third proxy node is inactive, or remove an identifier of the third proxy node from the proxy node data; and send, after the modifying the proxy node data, the data states of the first proxy node to the remaining proxy nodes from the plurality of networked proxy nodes to synchronize the data states of the plurality of networked proxy nodes.
 14. The apparatus of claim 9, wherein the synchronization event includes at least one of: a proxy node being added to the plurality of networked proxy nodes; a proxy node from the plurality of networked proxy nodes becoming active after a period of inactivity; or an expiration of a predefined period of time.
 15. The apparatus of claim 9, wherein the processor is further configured to: synchronize the data states of the first proxy node with the data states of the remaining proxy nodes at periodic intervals.
 16. The apparatus of claim 9, wherein the processor is further configured to: receive, via a command-line interpreter (CLI) or a RESTful API (REST API), an input to modify information included in the data states of the first proxy node; modify the data states of the first proxy node based on the input; and send, after the modifying the data states of the first proxy node, the data states of the first proxy node to the remaining proxy nodes from the plurality of networked proxy nodes to synchronize the data states of the plurality of networked proxy nodes.
 17. A non-transitory processor-readable medium storing code representing instructions to be executed by a processor, the code comprising code to cause the processor to: detect, at a first proxy node from a plurality of networked proxy nodes, a synchronization event; in response to the synchronization event, synchronize data states of the first proxy node with data states of remaining proxy nodes from the plurality of networked proxy nodes such that each proxy node from the plurality of networked proxy nodes has synchronized data states, the synchronized data states including information descriptive of a set of Application Programming Interfaces (APIs) implemented on a plurality of servers and information associated with API requests received at the plurality of networked proxy nodes; send, from the first proxy node, a first API request to a first server from the plurality of servers based on the synchronized data states, the first server associated with a first data center; and send, from the first proxy node and via a second proxy node from the plurality of networked proxy nodes, a second API request to a second server from the plurality of servers based on the synchronized data states, the second server being different from the first server and associated with a second data center different from the first data center.
 18. The non-transitory processor-readable medium of claim 17, wherein the code further comprises code to cause the processor to: maintain routing information stored in a Domain Name System (DNS) server based on the synchronized data states such that the first API request is routed to the first proxy node and the second API request is routed to the second proxy node based on the routing information.
 19. The non-transitory processor-readable medium of claim 17, wherein the synchronized data states further include: session data including information descriptive of one or more client devices communicating with the plurality of servers via at least one proxy node from the plurality of networked proxy nodes; security data associated with one or more sessions between the one or more client devices and the plurality of servers; configuration data associated with routing the API requests to the plurality of servers; and proxy node data including information descriptive of the plurality of networked proxy nodes.
 20. The non-transitory processor-readable medium of claim 17, wherein the code further comprises code to cause the processor to: in response to the second proxy node becoming inactive, modify routing information stored in a Domain Name System (DNS) server such that a third API request received after the second API request is routed to a third proxy node from the plurality of networked proxy nodes to be sent to the second server. 